AITopics | tuoma haarnoja

Collaborating Authors

tuoma haarnoja

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Maximum Entropy Monte-Carlo Planning

Chenjun Xiao, Ruitong Huang, Jincheng Mei, Dale Schuurmans, Martin Müller

Neural Information Processing SystemsFeb-12-2026, 17:47:59 GMT

The idea is to augment Monte-Carlo TreeSearch (MCTS) withmaximum entropypolicyoptimization, evaluatingeach search node bysoftmax values back-propagated from simulation.

artificial intelligence, machine learning, sft, (17 more...)

Neural Information Processing Systems

Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.96)

Add feedback

ResidualPathwayPriorsforSoftEquivariance Constraints

Neural Information Processing SystemsFeb-12-2026, 00:58:54 GMT

A scene may have long range non-local interactions, rotation equivariance may be violated by a preferred camera angle, or a dynamical system may occasionally have discontinuous transitions.

artificial intelligence, arxivpreprintarxiv, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States > California > San Diego County > San Diego (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)

Add feedback

A Regularized Approach to Sparse Optimal Policy in Reinforcement Learning

Wenhao Yang, Xiang Li, Zhihua Zhang

Neural Information Processing SystemsFeb-12-2026, 00:07:26 GMT

Even if the optimal policy is obtained precisely, it is often the case the optimal policy is deterministic.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

Asia > China > Beijing > Beijing (0.05)
North America > United States > New Jersey > Mercer County > Princeton (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.67)

Add feedback

a3bf6e4db673b6449c2f7d13ee6ec9c0-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-9-2026, 15:47:39 GMT

deep reinforcement, gradient, tuoma haarnoja, (10 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

DiscoveredPolicyOptimisation

Neural Information Processing SystemsFeb-9-2026, 13:45:34 GMT

Most of these advancements came through the continual development of new algorithms, which were designed using a combination of mathematical derivations, intuitions, and experimentation. Such an approach of creating algorithms manually is limited by human understanding and ingenuity.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.48)

Add feedback

a3bf6e4db673b6449c2f7d13ee6ec9c0-AuthorFeedback.pdf

Neural Information Processing SystemsAug-15-2025, 13:49:07 GMT

artificial intelligence, gradient, machine learning, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

vitchyr/rlkit

#artificialintelligenceDec-23-2019, 07:59:15 GMT

Reinforcement learning framework and algorithms implemented in PyTorch. To get started, checkout the example scripts, linked above. The initial release for 0.2 has the following major changes: Overall, the refactors are intended to make the code more modular and readable than the previous versions. These Anaconda environments use MuJoCo 1.5 and gym 0.10.5. You'll need to get your own MuJoCo key if you want to use MuJoCo.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

#artificialintelligence

Industry: Information Technology > Services (0.32)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.36)

Add feedback

Soft actor critic – Deep reinforcement learning with real-world robots

RobohubJan-21-2019, 16:11:57 GMT

We are announcing the release of our state-of-the-art off-policy model-free reinforcement learning algorithm, soft actor-critic (SAC). This algorithm has been developed jointly at UC Berkeley and Google Brain, and we have been using it internally for our robotics experiment. Soft actor-critic is, to our knowledge, one of the most efficient model-free algorithms available today, making it especially well-suited for real-world robotic learning. We also release our implementation of SAC, which is particularly designed for real-world robotic systems. What makes an ideal deep RL algorithm for real-world systems?

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Robohub

Industry: Education (0.37)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback